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Abstract 

The analysis of Tables of particle properties shows that the probability distribution 
of the results of physical measurements is far from the conventional Gaussian p(^) = 
exp(-^V2), but is more likely to follow the simple exponential law p{£^) = exp{—^) 
is the deviation of the measured from the true value in units of the presented standard 
error). A gap between the expected and actual probabilities grows with ^ very rapidly, 
amounting to 10^ at ^ ~ 6, and is significant even at ^ = 2. A more detailed study 
reveals the two-component structure of the distribution: the exp{—^) law is closely 
fulfilled up to = 3, but then, at ^ larger than that, the decrease is retarded drastically. 
This behaviour can be associated with the existence of two various types of systematic 
errors, the detected and undetected ones. Within some model, both types of errors are 
seen to affect the form of the distribution, one at moderate ^ and the other at large 
^. The first type (detected) errors are shown in some natural-looking assumptions to 
yield the distribution not quite equal but close to the simple exponential. 



1 Introduction 

It is a usual practice in experimental physics to estimate the probability for some measured 
quantity having a certain value with the use of Gaussian probability distribution. Namely, 
if the measured value is A± A, then the probability that the true value lies in the interval 
(a, a + da) is commonly found by the formula (Gaussian standard law): 

da -i^^ 
^^(«) = -/|^e 2A2 . (1) 



The reason for doing so is supposedly provided by the known central limit theorem 
asserting the validity of Gaussian law (|l|) for any random variable satisfying certain rather 
general requirements (see e.g. |]I|). Surely, the fulfilment of these requirements as a rule is 
not analysed in detail, merely postulated when there are no special reasons for doubt; but 
the Gaussian law, whether justified or not, is extensively used and apparently borne out 
experimentally in many domains of statistics. 

A characteristic feature of the Gaussian distribution is very rapid fall-off of the probability 
with the departure from the most probable value. Defining the normalized deviation 

5 = ^ (2) 



and the integrated probabihty from (|l]) 




($ is the known probabihty integral), we have e.g. w{A) ~ 10^^, w{5) ~ 10~^, w{6) ~ 10~^, 
etc. These numbers are too small to be taken seriously, and they have hardly been ever put to 
a severe experimental test; one can only argue to have checked the Gaussian distribution in 
the region of moderate deviations (^<4) where probabilities are measurable. Still Gaussian 
estimates are widely used also in that faraway region (^>6), evidently for the lack of a better 
method. 

In particle physics, however, the measurement results deviating from exact values by as 
much as 6 errors or more are anything but an extreme rarity, their probability has in fact 
nothing to do with such tiny numbers as 10~^ and amounts to a quantity of order 10~^ (as 
estimated from the total number of data ~ 10^ in Tables of particle properties). This huge 
discrepancy (about 7 orders), taken literally, is not easy to eliminate within the Gaussian-like 
behaving distribution functions. A suggestion had been put forward (hardly in earnest), and 
gained popularity, that a true estimate of probability should be obtained with substituting 
for the dispersion A in eq. (|l|) an experimental error multiplied by 3, since 

expl —- -6 ~ 10 exp \ — - 




The scale factor 3 for the error looks unreasonably big, but regardless of its exact value, 
this primitive approach appears unsatisfactory because of overstating the significance of a 
few wrong or inaccurate measurements with large ^ in the bulk of much better experimental 
data. Large deviations are obviously due to systematic errors which plague many modern 
ingeniously designed complex experiments and are difficult to estimate or even to unveil. 
As no general method of doing this is accepted and every experimentalist uses here his own 
discretion, there is little reason to believe a priori in Gaussian or some other form of the 
distribution in question, it is moreover unclear even whether such distribution of any kind 
should exist at all. 

In 1973 the author did the work intended to learn the probability distribution of 
measurement results with ^ in experimental way, by closer examination of Tables of particle 
properties. It was found that the actual distribution (with the understanding that it does 
exist) showed the behaviour quite different from Gaussian, decreasing with ^ much more 
slowly. Remarkably, the form of the distribution appeared to agree fairly well with the 
simple formula 

MO = e-«, (4) 

or, in terms of eq.(0), 

da 

dw{a) = — e A . (5) 

Noteworthy is the absence of a possible number coefficient in front of ^ in the exponent, i.e. 
this coefficient proved to equal 1 with reasonable accuracy. 

In a related work by M.Roos et al. [0] the problem was treated somewhat differently. 
These authors also detected large departure of the real data distribution from Gaussian but 
gave preference to a power-behaving (Student-like) fit involving two free parameters (see eq. 
(|T3D of Sec. 2). While looking less attractive than our eq. (HI), it provided better conformity 
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with the actual distribution; however, the arguments to its favour are not convincing and do 
not allow one to decide between this and some other fit of a similar kind. The fact is that 
the form of the fitting function is not specified by a theory and can be chosen more or less 
arbitrarily. The present work in its major part is aimed at finding this functional form from 
some probable-looking model assumptions. 

The attempts to explain the non-Gaussian behaviour of data distribution made in both 
works 0, ^, though different in resulting formulae, were conceptually similar and based on 
the idea of taking into account the inevitable uncertainty in the experimentally determined 
errors. The ambiguity of the result is a consequence of essential dependence on the assumed 
form of the unknown probability distribution for the errors. Here is the weak point of this 
approach: the effect of the error uncertainty, surely existing and significant as it is, escapes 
solid quantitative estimates and can provide only a crude explanation of the Gaussian law 
violation, defying accurate experimental tests. 

It will be seen however that there exists another, in a sense competitive effect which is 
more liable to evaluation and may appear dominant at some ^. It can be designated the 
variable dispersion effect and is operative even with fixed and exactly known mean squared 
error, provided the dispersion of the measuring device is not constant but depends on the 
value of the measured quantity. While the first effect (the error uncertainty) is caused mainly 
by the systematic errors overlooked or radically underestimated by the experimentalists, the 
second effect can be attributed to the influence of those systematic errors which are detected 
and taken properly into account, but nevertheless induce a distortion of Gaussian law in 
default of compliance with the conditions of central limit theorem. 

In this work the experimental data probability distribution is reexamined on the basis of 
a later and more abundant statistical material. It turns out that the distribution exhibits 
a well-marked two-component structure characteristic of the superposition of two functions 
with different behaviour (see fig. 2, curve 1 to be discussed below): one fall-off regime is 
changed rather abruptly by the other near the point ^ = 3. It seems natural to associate these 
two regimes with two various mechanisms whose contributions have different ^ dependence 
and become comparable at ^ 3. The right-hand portion of the distribution curve (^ > 
3) which was not quite clearly visible with the earlier poor statistics may well have been 
produced by the effect of error uncertainty, while the left-hand portion following closely the 
simple-exponential law of eq. (^) could be shaped with the variable dispersion effect which 
will be seen to yield a distribution of a similar form. Of particular interest is the fact that 
the unit coefficient in the exponent gets a natural explanation in this scheme. 

The paper is organized as follows. Sec. 2 contains an account of the method and results 
of experimental determination of the data probability distribution with the use of Tables 
of particle properties. The remaining part of the paper is concerned with the theoretical 
interpretation of the obtained distribution. In Sec. 3 the effect of the error uncertainty is 
considered. Sec. 4 is devoted to a cursory discussion of the central limit theorem and a 
possible way of its expansion. In Sec. 5 which is the main in the work, a model of the 
variable dispersion effect is developed and analysed. It is shown that under some rather 
general assumptions the asymptotic form of the distribution at ^ 1 must be ^~^exp{—^). 
The last Sec. 6 contains a short review of the results and some concluding remarks. 
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2 Experimental distribution 



For experimental evaluation of the data probability distribution all which is needed is a 
sufficiently abundant set of measurements of some quantity with the exactly known true 
value. Of course, the true value is normally unknown during the measurement, but we can 
restrict our consideration to the old experiments for which the later, much more accurate 
results are now available to be taken for the exact values. As for the number of measurements, 
although it is hardly ever large enough when dealing with only one definite physical quantity, 
it can well be made statistically significant by using the hypothesis of universality of the 
distribution, i.e. assuming the probability w{C,) to depend only on C, and nothing else. The 
probability density will then agree with the formulae 

dw{a) = (6) 

p{0 = -w'{0, (7) 

generalizing eqs. (|l|) and (^. This hypothesis, if true (which point will be returned to below), 
allows one to take into account all measurements with the same ^ on equal terms, lumping 
together the results of quite dissimilar experiments, irrespective of the quantity measured 
or the technique employed. Note that we suppose the distribution to be symmetric about 
its centre, since ^ defined by eq. (0) is an even function of {a — A), so we should avoid 
consideration of measurements with manifestly asymmetric errors. 

The distribution function w{$,) designating the probability for the measurement result to 
deviate from the true quantity by ^ or more standard errors, can thus be found from any set 
of data by the formula 



where N is the total number of data in the set (supposed to be very large), a, is the exact 
value of the quantity measured in i-th experiment, Ai and Aj are its measured value and the 
error, and 6{x) denotes the Heavyside step function (1 for x > and for x < 0). When 
using Tables of particle properties as a data source, Ai and Aj are suggested to be taken 
from an earlier ('old') Table, Oj from the latest ('new') Table. So in practice the values of a, 
are themselves not quite exact but approximate with the new errors 6i, and can be treated 
as nearly exact only when a new error 6i is small compared to the corresponding old error 
Aj, which is not always the case. The influence of uncertainty of can be suppressed by 
including in the set only the data with sufficiently large ratio Ai/Sf, in what follows a rather 
liberal data cutoff is used 

^>2.5. (9) 

(In changing the cutoff from 2.5 to 4 was found to reduce the number of data N by about 
10% with quite negligible effect on the distribution). 

The described method of finding the function w{^) was applied ffist in 1973 0] to the 
Tables of 1964 Q and now once again to the Tables of 1978 (the old data), with 'exact' 
values extracted from the new Tables of 1972 ^ and 1994 0| respectively. Apart from the 
restriction (^, the procedure of data selection included yet some more criteria. The following 
data types were rejected: 

— the results treated by the compilers as unreliable or preliminary (enclosed in paren- 
theses) ; 
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— the data taken from review articles (marked by the label RVUE); 

— the results of measurements of essentially positive (by physical meaning) quantities 
when the error exceeded half the measured value; 

— the data with asymmetric errors of the form Af^l when Ai and A2 differed by more 
than 10% (otherwise the result was accepted with the error taken to be symmetric and equal 
to the greater of Ai and A2); 

— all data relating to wide resonances, F > 120 MeV. In p|, moreover, no resonance 
data at all were considered, only those relevant to strong-interaction stable particles. 

We are not dwelling on the discussion of these constraints, aimed mainly at lowering the 
contribution of dubious or too uncertain results and of some temporal changes in general 
concepts (e.g. in the common definition of the mass and width of very wide resonances). 
One may argue if the variation (strengthening or loosening) of formulated criteria should 
really have no effect but the change of the total normalization; we shall touch on this point 
once more in the next section. 

It must be emphasized that the total statistical sample of 933 data examined in this work 
includes all measurement results contained in the Table |^, irrespective of the quantity taken, 
except only for those rejected by the above-listed restrictions. Its inspection shows that it 
comprises four groups of measurements having comparable weights (classified by the quantity 
that was measured): measurements of particle masses (223 data, or 24%), lifetimes or widths 
(196 data, or 21%), branching ratios of various decay modes (316 data, or 34%), and the 
last group (198 data, or 21%) combines all other particle characteristics (magnetic moments, 
form factors, parameters of angular and energy distributions of decay products) as well as 
some interaction constants {gA/dVi parameters of CP- nonconservation etc.). No account 
was taken of possible correlation between various quantities obtained from one experiment, 
they were taken to be independent if the compilers used them in statistical averaging. The 
number of such many-data experiments does not seem to be very small, this effect may 
cause some distortion and should be kept in mind. Another analogous interfering factor 
could be the mutual influence and correlation between the results of various experimental 
groups which are ideally viewed as quite independent. The significance of these effects is 
difficult to consider quantitatively, but one may hope it would be true to ignore them in the 
first approximation. 

The results of the described treatment of Particle Data Tables are shown in figs. 1 and 2 
(the curves labelled 1). Fig. 1 reproduces the earlier finding and is given here for conve- 
nience of comparison; the newly obtained distribution of data from |^ is presented in fig. 2. 
The difference is seen to be modest and can well be assigned to damping of statistical fluctu- 
ations with growth of the total statistics from 209 to 933; however, it should be noted that 
there is a certain systematic excess of the distribution in fig. 2 over that in fig. 1 which may 
appear to be evidence against the above-formulated universality. Here we shall not discuss 
this point in much detail, but turn now to the other, much more pronounced features of the 
resulting distribution. 

First of all, the real distribution is apparent to differ radically from Gaussian (curve 2 in 
figs. 1 and 2), especially at large ^, the fact already noted above. The difference grows quickly 
with ^ and ranges up to many orders at ^>6, but is rather large even for moderate values 
of ^ (~ 3 times at ^ = 2, ~ 20 times at ^ = 3). It is clear that Gaussian estimates highly 
underrate the deviation probabilities nearly everywhere in ^ and are entirely misleading. 

An essential fact here is that the real distribution curve 1 not only passes much higher 
than the Gaussian curve 2, but moreover exhibits a distinctly different behaviour: the 
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two curves have the opposite convexity (in log scale), i.e. the real probability decay rate 
d\lnw\/d^ decreases with ^, while by Gaussian law it should be growing ~ ^. It follows 
that there is little point in modifying Gaussian law (|l]) with only renormalizing the error A 
by any constant factor, as this procedure leads to nothing more than stretching the curve 
along the ^ axis with no effect on convexity. The curves 4 and 5 in fig. 1 show that dou- 
bling, the more so tripling the error (i.e. replacing ^ in eq. (^ by .^/2 or ^/3) results in 
grossly overestimating the deviation probabilities against their experimental values in the 
region of moderate ^. Clearly, the agreement cannot be achieved by any choice of the error 
renormalization factor and the Gaussian-like approach itself does not work. 

It is natural to try for the experimental distribution the linear-exponential approximation 
(^ (straight line 3 in figs. 1 and 2) which appears to fit rather well, better than one could 
expect without any prerequisite. Specifically, a fact deserving consideration is the unit 
slope of the fitting curve, which corresponds to the above-mentioned unit coefficient in the 
exponent of eq. (H). Of course, it may well be a mere coincidence, but it is suggestive to see 
here a regular phenomenon and to search for its origin. 

Closer inspection of fig. 2 shows that the linear-exponential fit (^ being very good at 
< 3 breaks down at large ^ where the real probability curve 1 displays essentially slower 
fall-off. It seems that the point ^ = 3 marks the region where the fall-off regime undergoes a 
change closely resembling that of the decay curve of a two-component radioactive substance. 
A limited statistics (65 measurements with ^ > 3 of the total 933) does not allow one to 
recognize the functional form of the decrease at large ^, but its slowing down is seen clearly 
enough, in distinction to fig. 1 where the statistics was too meagre for that (9 measurements 
with ^ > 3 of the total 209). As a simple illustrative example, the function with a similar 
behaviour 

is displayed in fig. 2 (curve 4). The value of the constant C is chosen so that the two terms 
in the numerator were equal at the breakpoint ^ = 3, the denominator serves to ensure the 
normalization condition ^ 

^i(o) = / pmi = i. (11) 

Of course, this function cannot represent the real distribution at all ^ since it does not vanish 
at ^ oo, but can be considered as a reasonable approximation for moderate ^ when the 
'long-lived' component decays very slowly and its change is inappreciable in the range of ^ 
under study. 

It should be noted that both approximations (H) and (|10|) are expected to fail also at 
small because they do not satisfy the condition 

= (12) 

which must be fulfilled if the probability density p(^) is even in and analytic at the point 
,^ = 0. In fact at small the distribution follows Gaussian rather than linear-exponential 
law, which can be ascertained by viewing fig. 2 at some magnification; the difference is slight 
but unambiguous in favour of Gaussian behaviour. 

Before proceeding to the interpretation of the indicated features of the experimental data 
probability distribution, it will be pertinent to touch briefly on the work concerned with 
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Figure 1: 

Q — experimental distribution of 209 measurement results, ref . [0 ; (2) — Gaussian standard 
law, eq. (|); (3) — simple exponential law, eq. — Gaussian distribution with doubled 

and tripled dispersion. 




Figure 2: 

(T) — new experimental distribution of 993 measurement results from ref. 0; (2), (3) — same 
as in fig. 1; — modified exponential distribution, eq. (|10|) . 
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similar problems. Its authors had a well-defined practical aim — to develop a sound proce- 
dure for averaging together the poorly reconcilable measurement results obtained by various 
experimental groups, and doing this required information on the real data distribution. To 
get it, they used a procedure differing from ours in some respects. First, they made efforts to 
possibly lower the infiuence of systematic errors and tried to select only those measurements 
which would not be subject to them. Second, they had to do with only one table (the latest 
one) and no later tables, so in place of the true value a in eq. (0) they used a weighted mean 
calculated by averaging the related measurement results from the same table. Thus their 
distribution does not quite coincide with ours and some difference would appear natural. It 
is worth noting that they did not merely postulate the universality condition (|^ but put 
it to some check, separating the total sample of 306 measurements into several groups and 
comparing the distributions in various groups; all of them proved to be essentially the same 
within the probable statistical fluctuations. 

In flg. 3 curve 1 shows the total data distribution from |0 represented in the form suitable 
for comparison with flgs. 1 and 2 (in a histogram for the probability density p(^) was 
given; here it is recast into the integral distribution w{^) with some interpolation). It can 
be observed to bear certain similarities to the distribution in flg. 2: a remarkable excess 
over the Gaussian distribution (curve 2), an approximately linear-exponential decrease at 
intermediate ^, and bendings (decay slowing down) both at small and large ^ (the latter being 
not quite clearly seen probably owing to deflcient statistics, only 1.5 times that in flg. 1). 
There are also clear-cut differences: the large ^ bending (if any) occurs later in ^, and in the 
linear-exponential region the slope is about 1.3 instead of unity (i.e. w{^) ~ exp {—1.3 ^) ), 
so approximation (H) (curve 3) is here not good. On the whole, the distribution looks as a 
kind of interpolation between the Gaussian law and something like eq. (|TD|) (curves 2 and 
4 in flg. 2). 

For a fltting function to describe the obtained distribution, there was taken in a slight 
modiflcation of Student's formula 




p(0 = K 1 + ^ (13) 



containing two free parameters, n and c (normalization constant K is deflned from eq. (|TT])) 



With c = 1 and integer n, eq. (|T3|) represents the probabihty density of a random quantity 



n 



n+1 
i=l 



n + 1 



\ 




(14) 



which is a function of (n + 1) random variables Xi all having identical normal distributions 
with zero mean value (or alternatively they can be viewed as the results of {n+1) independent 
measurements of a random variable normally distributed about zero). A scaling parameter c 
is introduced to take into account a possible underestimate of the measurement error by the 
experimentalists just in the manner discussed in Introduction. The values of the parameters 
adequate to experimental distribution were found to be 

n = 10, c= 1.11. 
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Figure 3: 

(T) — distribution of 306 measurement results from ref. 0; (2), (3) — same as in fig. 1; 
modified Student's distribution, eq. (|T3D. 
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The corresponding integral distribution w{C,) is represented by curve 4 in fig. 3 which 
practically coincides with the experimental curve 1 nearly everywhere (except for large ^ 
where the statistics is scanty). Notwithstanding this coincidence, the form of eq. (|T3|) does 
not look convincing; below the arguments will be adduced for another dependence, more 
closely resembling (but not identical to) eq. (|TUp. 



3 The effect of the error uncertainty 

The most prominent feature of all distributions considered above (figs. 1-3) is their large 
excess over the Gaussian law predictions, and its explanation is natural to try to begin with 
taking into account the obvious fact that the experimental error is never measured exactly 
but may differ significantly from the true dispersion of the measuring device. Such approach 
was applied in and in a less explicit form in |0; here it will be reexamined in some detail. 
Let us consider a model based on the following postulates: 

1. Each measurement device or experimental setup is subject to fiuctuations producing 
at the output in place of the measured quantity a a random quantity x spread with some 
probability density p{x, a) which is an individual characteristic of a given device. In other 
words, when measuring a quantity a, one obtains the result confined between x and x + dx 
with the probability p{x, a) dx. The probability density p{x, a) is assumed to have a Gaussian 
form 

(x— a— s)^ 

p{x, a) = — e (^i^'j 
V27r cr 

with two parameters s (scale shift) and a (dispersion) characterizing a given device. 

2. There exists in the world a huge number M of devices distributed over the parameters 
s and a with the density P{s,o), so that the number of devices with the values of the 
parameters between (s, cr) and (s + ds,a + do) is dM = MP{s, o) ds da. 

3. The parameters s and a of any given device are not known exactly but measured with a 
limited accuracy and found, instead of (s, cr), to be in the range of (s', a') to {s' + ds', a' + da') 
with the probability q{s,(T\s' ,a')ds'da' . The probability density g(s, cr|s', cr') is supposed to 
be i) a function of the difference (s — s'), not of s and s' separately, and ii) a homogeneous 
function of its four arguments, i.e. 

q{s,a\s',a') = \ql- —] , (16) 

o"^ \ (7 a J 

where q{u, t) is dimensionless. 

In such model the experimental distribution of measurement results with the quantity 

e = ^^4^ (17) 

cr' 

will be 

^^J^^"^ = J P{s, cr)ds da J q{s,a\s' , a')ds'da' J dx p{x,a) 6^ ^ 

= P{s,a)dsda q{s,a\s\a')ds'da'^^e . (18) 



11 



Note that ^ as defined by eq. ([T7|) slightly differs from that of eq. (H) and is no more a positive 
definite quantity; to come back to former definition in eq. (^5[) and analogous formulae below 
a symmetrization with respect to — should be made. 
Using eq. (|1^ and introducing dimensionless variables 

s' — s a' , ^ 

u= , t = -, (19) 

a a 

we can separate the integral over ds da and take it in view of the normalization condition 

' P{s,a)dsda =1, (20) 



so, as might be expected, the distribution of devices P(s, o) drops out of the formula and 
eq. ([T8|) becomes 

p(0 = / rfudtg(M,t)te-^(^ + *^)'. (21) 
v27r J 

An essential input on the right of eq. (|2T1) is the function g(u, t) governing the accuracy of 
determination of the measurement error a', the resulting distribution being highly dependent 
on the suggested properties of this unknown function. Clearly, taking 

q{u,t)=5{u)5{t-l), (22) 

we arrive at Gaussian distribution (|^). Replacing 5{t— 1) by 5{t — k) (with k constant) gives 
the scaled Gaussian law discussed in Introduction. Any kind of smearing the (5-function 
yields generally some departure from normal distribution, and there is nothing to prevent 
from getting any reasonable expression for the result by choosing an appropriate form of 
q{u,t). In particular, putting 

g(n,t)=5(«)^e"#, (23) 

we get from eqs. (^) and (|^) 

MO = ^e-«v^, (24) 



which at a = 1 is just eq.(^). It can be remarked that if q{u,t) has the form 5{u)q{t) with 
q{t) smooth and vanishing both at t = and t — >■ oo, then the small ^ behaviour of w{^) is 
determined in general by the large ^ asymptotics of g(t), and conversely, the asymptotics of 
^(0 ^-t ^ oo depends on the behaviour of g(t) at t = 0. 

Among other possible expressions for the function g(u, t) of interest is that one which 
corresponds to the parameters c and a of a device being determined from a certain number 
n of independent measurements of some known quantity a. If the results of these calibrating 
measurements are Xi (z = 1, ... ra), then 

n \ ^ 

s=-^Xi-a, a'"^ = _ ^{xi - a- s'f, (25) 
^ i=i n I 

so, in view of the relation da'^ = 2a' da' and eq. (|15|), 

g(s, (t|s , cr ) = 2(T / — = — e 2cr^ ' ■ 

-I (v27rcr)" 

Ks' + ^i) ^ (xi - a - s'Y) (26) 

. ^"'~\-^\{n-lW + n{s'-sf] 
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where use is made of the relation 



n-l 

/ '<' '<' jY 2 n— 3 

d-x SiY^x,) 5(E -R)= , R— (27) 



i=i i=i ^/nT{^^) 



and the normahzation factor is dropped as inessential. It follows from eqs. |2g and ^ that 
in this model 

„ n 2 n-l J.2 

g(M,t) ~r-2g-2M - — t _ ^28) 
Inserting this into eq. (pi]), we obtain 

n 

+ \ (29) 

which is nearly the same as eq. (|I3|) . (The exact eq. ([13|) with c = 1 and n replaced by n — 1 
would result if the w-dependence in eq. (E^ were taken to be S{u) ). 



Eq. (^91) presents a typical appearence of the departure from normal distribution owing 
to error uncertainty induced by statistical reasons only, i.e. by insufficiently large number 
n of measurements. Obviously, at n oo the distribution becomes Gaussian, except for 
very large ^ > n^^^ where the correction ~ ^^/n becomes essential; at still larger ^ > y/n the 
quadratically-exponential fall-off is in fact changed by a power. In this model systematic 
errors are disregarded or, maybe one can say, simulated by something looking like statistical 
errors, which does not seem to be correct, and that probably manifests itself in the necessity 
of introducing a scaling parameter c into eq. (plSj) . 

The implication of the considered examples is that no definite conclusions regarding the 
probability distribution w{^) can be drawn from eq. (^TD without one or other assumption 
about the function q{u,t) which embodies information on the mechanism of generating the 
error uncertainty. However, there is yet another simple model worth discussing, based on 
the hypothesis that the primary source of the departure from normal distribution hides in 
the overlooked systematic errors. It is characterized by the function q{u, t) of the form 

q{u,t) = {6{u)+ij{u))6{t-l), (30) 

where fi{u) is some smooth function (an overall normalization coefficient is omitted). Con- 
trary to both models of 0] and 0], here the the device dispersion a (the spread of the 
distribution (refl5) ) appears to be measured with a good accuracy {a' = a due to the 
second 5-function and eq. (p!9|) ), while systematic errors show up in some indefinite shift s 
of the measurement scale and are refiected by the addition of ii{u) to 5{u)'m. eq. (|30|) . The 
additive form of the first multiplier corresponds to the assumption that all measurements 
can be divided into two separate groups of a comparable weight — the 'good' measurements 
which are free from systematic errors, and the 'bad' ones which are subject to them, these 
two groups being represented by two terms 5{u) and ii{u) respectively. Since the systematic 
errors are randomly distributed in their values without any visible correlation to a measured 
error (which they may exceed many times), the function ^{u) can be considered as nearly 
constant at w ~ 1, with some decrease at very large u rapid enough to provide the convergence 
of the normalization integral. So, the dependence on u in eq. (pOD looks as a sharp peak 
at M = rising above a smooth, almost uniformly smeared background. Clearly, the weight 
of the background is expected to vary with changing the criteria of the data selection, and 
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in this sense the above-discussed universahty of the distribution w(^) cannot exactly hold; 
the function fi{u) can be viewed to contain an uncertain numerical factor depending on the 
selection procedure. 

Inserting eq. (|30D into (0), one can perform the integration if the function fi{u) is slowly 
varying {li'iu) <^ 1), and get 

P(0^4=e~¥ +/i(-0, (31) 

since the Gaussian exponential of eq. (0) acts as a (5-function when integrated with yu(-u). 

The obtained formula is qualitatively similar to the above eq. ( p!OD — it represents a sum 
of two functions, one rapidly and the other slowly decreasing, and its plot must have a bend 
near the value of ^ at which the two terms become equal. One of the pronounced features of 
experimental distribution (curve 1 in fig. 2) thus finds rather simple and natural explanation 
in the influence of unrevealed systematic errors. There is however a remarkable difference 
in the first term between eqs. (^) and (p^O]). In the model just considered the distribution 
is distorted only at large ^ and remains nearly Gaussian to the left of the bending point, 
while the experimental curve 1 runs essentially above the Gaussian curve 2 even at ^ < 3. 
Moreover, had the first term of eq. (|T0|) Gaussian-like form, the right-hand nearly horizontal 
portion of the curve (at ^ > 3) would have to go much lower than it really does, since from 
eq. (H) w(3) ~ 0.003 instead of actual 0.05. This disparity may look not serious, as it can be 
easily eliminated by an appropriate smearing of the (5-functions in eq. (pOl) . However, such 
way of doing is not quite satisfactory because of its arbitrariness and indeterminacy, it leaves 
too much freedom in the choice of the function q{u,t) and does not allow to say anything 
definite about what the distribution must be like for theoretical reasons. Nor is it of any use 
in clarifying the origin of the unit coefficient in the exponent of eqs. (p!0D and (^) which is 
a pure accident in this scheme. Therefore it makes sense to look for some other sources of 
possible distribution deformations which could be effective at < 3 and maybe shed light 
on the latter problem. 



Eq. ( pTD can be easily extended to the case of the probability density p{x,a) in eq.(|T^) 
having a more general than Gaussian form 

p{x,a) = -p( , (32) 



a \ a 

then 



p{^) = J dudtq{u,t)tp{u + t^). (33) 



If the function p{y) is rapidly decreasing at large argument, the hypothesis of eq. ( pO]) 
yields a similar generalization of eq. (0): 

p(0^p(0 + M-0- (34) 

Again we see that at ^ not very large where the first term is dominant (the boundary value of 
^ depending on the selection criteria), the experimentally measured distribution duplicates 
the primary scatter caused by the device, and we come back to the problem of explaining 
its non-Gaussian character. So, in what follows we ignore the mesurement uncertainty of 
the function p{^), taking q{u,t) to be defined by eq. (p2D, and concentrate on examining a 
possible form of p{S,) itself, which is then the same as p(0- 
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4 The central limit theorem 



Normal distribution, be it justified or not, holds a specific position in statistics and is often 
favoured over other distributions in default of information. So, clearing up the question why 
should (or shouldn't) some probability distribution be normal, it is worthwhile to recall the 
central limit theorem which is in fact the only argument for that. A simple version of the 
central limit theorem (there are many of them, see e.g. [|1|) can be formulated as follows. 

Let xi, X2, ■■■Xn be n independent random quantities with distribution density of Xk being 
Pk{xk)- Let three lowest moments of each distribution pk exist: 

Mxfc = Xfc, M(xfc = cTfc, M\xk-Xk\^ = 4 (35) 

(here M/ denotes mean value of /). Construct a new variable y = J2k=i^k having the 
distribution density 



/n 
dxi...dxnp{xi)...p{xn) S{y - Xk) (36) 
k=l 



with the moments 



My = Y.Xk = y, M{y-yy = J2a^ 



2 _ „2 

k 



a 

k=l k=l 

n n 

MY,\xk-Xk? = Y.^l = t\ 

k=l k=l 



(37) 



and consider the limit n oo. The theorem states that if the requirement 

lim - = (38) 

n— >oo (J 

(Liapunov's condition) is satisfied, then, whatever the distributions Pk{xk) are, the distribu- 
tion p{y) tends to Gaussian with mean value y and dispersion cr: 



1 



The meaning of the condition (|38D, roughly speaking, is that it necessitates fluctuations of 
all Xk to make comparable contributions into fluctuations of y, preventing from only a small 
number of them being really effective. In a typical case with all ak of the same order, a ~ 
oin^/^, t ~ ain^/^, 

t/a ~ n~^^^ — > 0. (The overall factor oi itself is supposed to be ~ so that a 

tends to a constant a.t n oo). 

For logical coherency we sketch here an outline of the proof. Introducing Fourier trans- 
forms 

dxe'''''pk{x) = Mu), (40) 
Jdye^''yp{y) = ^u), (41) 



we have from eq. 



*(«) = n Mu), ln$(M) = J2\nMu). (42) 
fe=i fe=i 
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With eq. ( ^8|) being fulfilled, each at n — oo is shrinking towards its maximum aX x = Xk 
and the exponential in the integrand of eq. (^) can be expanded near that point to yield 



,2 



2 

Inserting eq. into (^) and using eqs. (0), we obtain 



0,H^e^^^(l-^c;f), (43) 

2 

In^u) ^iu^-\oi . (44) 



2 

ln$(M) ^ iuy - ^cr^. (45) 

An important point is that the cubic and all higher terms in the right-hand side vanish at 
— > cxD due to eq. ([381) . Exponentiating eq. (^5]) and making inverse Fourier transform of 
$(n) according to eq. (0), we get the Gaussian expression ( |5UD for p{y). 



It may be remarked that the statement of the theorem is true also in a more general case 
when y is not an exactly linear function of the variables x^- It is sufficient for this function 
to be smooth in the vicinity of the point Xk = Xk and exhibit essential changes only with 
the departure from this point at distances much greater than the dispersions cxfc, so that it 
could be well approximated by its linear expansion in differences {x^ — Xk) (the coefficients 
of this expansion can easily be renormalized to unity by redefinition of Xk)- 

Another possible way of extension of the theorem is associated with abandoning the 
condition of independence of random variables Xk- If they are correlated, then the product 
Pi{xi)...pn{xn) in the integrand of eq. (BBf) which is effectively proportional to 



1 {xk - Xk) 



2 



2^ -I 



is expected to be replaced by 



Tki {xk - Xk) {xi - xi) ^ j^^g^ 



where Tki is some non-diagonal positive definite constant n x n matrix (Liapunov's condition 
( p8D is supposed to be properly modified). It is clear that such replacement does not affect 
the form of the function p{y) which remains Gaussian. 

If now a random quantity has a distribution different from Gaussian, it would be well to 
conceive which of the conditions of the central limit theorem is violated. There are not many 
possibilities for that. First one, most obvious and suggesting itself, can be that the number 
n of effective disturbing factors is not large or only a few of them fluctuate signiflcantly. A 
typical example is when all dispersions ak are negligible in comparison with one di and the 
sum in the second eq. (|37D is dominated by one term cr^, so that all Xk except Xi can be 
considered as constants and the distribution of a quantity y coincides in essence with that 
of Xi up to some shift. If n is large but flnite, then the dropped cubic in u term in eq. ( ^Sf ) 
is generally of order n{aiu)^ ~ {au)^ / ^/n , and since from eq. (^) (or its inverse) 

y-y 

u (47) 

the Gaussian law is expected to break down at 

^ = ^>^i/6^ (48) 
a 
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where this term becomes of order or more than 1. (This condition is replaced by ^ > n^^^ 
with the analogous estimate of a quartic instead of cubic term in the case when the latter 
happens to vanish). It is seen that the number n must be extremely large to provide the 
validity of the standard law even only for < 4. 

Another natural possibility for a quantity y to have non-Gaussian distribution consists 
in the nonlinear dependence of y on the disturbing factors Xk- The usual way of reasoning 
is that if the number n of x^'s is large and each partial dispersion is small ~ cto"^/^, 
then any smooth function y{{xk}) varying at scales of Xfc ~ a can be linearized for small 
variations of Xk ~ 0"^. However, it is not true, which is seen from the fact that if y is normally 
distributed then z = f{y) is generally not. This paradox emerges from the invalidity of a 
linear approximation for the function y{{xk}) when it changes essentially only in a small 
number of directions in the multidimensional space of {xk}', in this case largeness of n is not 
a sufficient condition for the nonlinear terms to be negligible, in fact their total contribution 
is comparable in value with the linear part of the expansion. So, the nonlinearity may matter 
when the measured quantities are being chosen at random, without any consideration of their 
dependence on the disturbing factors. We shall not discuss this point in more detail since 
one is not led here to any definite conclusion about the character of infiuence of this effect on 
the resulting distribution; it can only be thought to weaken with averaging over a variety of 
dissimilar data, while in distributions of measurements of one or a few analogous quantities 
it may appear more important. (This is yet another possible reason for violation of the 
universality discussed in Sec. 2). 

There is a further way worth considering to get a distribution different from Gaussian, 
which will be given more attention in the next section; it is associated with the possibility 
that the variables x^ might be correlated without representation (EBf) for their combined 
probability to hold. It can be realized e.g. by supposing the matrix T in (^) to depend on 
the sum of x^s which can be not small in spite of smallness of XkS owing to large n. A reason 
for such assumption is provided by observation that the accuracy of a measuring device is 
mostly not constant but different in various parts of the scale, usually best in the middle 
and worsening to the extremities. This approach seems to have more definite consequences 
as to the resulting data distribution than the previous ones, which will be seen from a model 
considered below. 



5 The variable dispersion effect 

Let the total set of variables Xk determining the measurement result y consist of two subsets 
of n and ni variables respectively, the former n being entirely independent and the latter 
ni entangled together. Both numbers n and ni are supposed to be large enough so that all 
Pki^k) could be taken in the Gaussian form and thus 

n+m f / " 1 

p{y) = (271)-^ / n -e ^ ■ 

■J \ k=l J 

j <r^x{<leiTf'^e~ h^kiXkXi ^(^ _ "g' ^ ^49^ 

where T^iXkXi implies summation over k, I running from n + 1 to n + rii. The mean values 
Xk are put to be zero by making proper shifts of x^s. 
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Taking the inner integral over d'^^x, we obtain 

p(y) = (27r)-t |rf"x[n^e~^] gl^y-Y^x^Y (50) 



k=l ^« / \ k=l 



g{z) = -^e ^, (51) 



'27C (To 

= (52) 

kl 

Eq. (^TD is very similar to eq. (pB]) with Gaussian-like probabilities pk{xk)', an important 
difference is that (5-function in the integrand has been changed by Gaussian-like function 
g{z), eq. (^21). This expression looks as a natural generalization of eq. (^) for p{y) when ?/ 
is not exactly equal to the sum of Xk but is a random function whose values are normally 
distributed about this sum with the dispersion (Tq. Eq. (pl|) may be postulated as a hypothesis 



itself, without reference to eq. (49); it seems possible for it to have more general character 
and to be produced by some other underlying mechanisms. 

It is seen that the form of the matrix T^i in eq. (^9]) does not matter much, it might 
well be diagonal. A significant point in what follows is the assumption that this matrix 
and hence the width ctq of the peak of the function g{z) may depend on (J^^^iXk), so that 
further integration in eq. (|5ll) becomes nontrivial. This dependence is supposed to model 
the above-mentioned variation of the measurement dispersion within the scale of a device. It 
should be emphasized that the two groups of variables x^ {k = 1 to n and k = n + 1 to n + rii) 
play different parts in this model even in the case of a diagonal matrix T^i which is taken 
to depend only on the first group of x^s. A possible way of interpreting the second group 
of rii variables is to consider them as the remnants of the corrected systematic errors. The 
correction that an experimenter has to introduce when he detects some systematic effect, 
while supposed to be a constant, may in reality depend on the measured value, and this fact 
can be simulated by the specified variability of ctq- 

Now, to calculate p{y), let us make again a Fourier transform of eq. (|5l|) , as we did it 
earlier with eq. (^41): 



Hu) = J dye'-ypiy) = (27r)-t J d"x (fl -) 



J dh6[h-J2xkj e '^=^ ' , (53) 

where a new integration variable h = X]fe=i is introduced. Substituting for 5-function its 
Fourier representation, we can integrate over Xk and get 

$(«) = [dhe'&'-'''^-P'^^'\ (54) 



n 

E (55) 

k=l 



Performing now inverse Fourier transform of eq. (|MD, we obtain 

1 r dh 
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In the case of cro{h) = const this formula obviously gives the Gaussian expression for 



p{y) with total dispersion A = ya'^ + a^. If ao{h) is a slowly varying function, then the 
integral is nearly the same but with A depending on y via ao{h), where /i is a function of y 
determined from the saddle point equation 



Typically <7o{h) is a growing function (at h > 0), and since h is nearly proportional to y 
from eq. (p?]), A also grows with y, which means that the distribution fall-off is slower than 
by Gaussian law. 

It makes sense to consider more closely of a quadratic function cfQ^h): 

al{h)=al + ^{h-h^)\ 7>0 (58) 

(7,o"o,/io are some constants), which is in fact the simplest suitable form, since a linear 
function is not appropriate because of its sign reversal at some point. Eq. (|58| ) can be viewed 
not merely as a Taylor expansion of some smooth function at small deviations [h — ho), but 
more generally as a reasonable approximation for any function with a similar behaviour in 
a bounded region, i.e. having a minimum and growing progressively in both directions from 
this minimum. 

An important property of the quadratic function o"o(^) thai it yields the simple- 
exponential fall-off of the distribution density p{y) at asymptotically large y. Indeed, the 
exponent in the integrand of eq. ( |5^ ) with regard to eq. ( |5BD at ?/ — oo, — oo and h <^y 
(this will be seen to hold for the saddle point) turns into 



y 



2 



2(t2 27/^2 ■ 

This function of h has a sharp maximum aX h = 7~^''^a/o^ and its value at this point is 

-yl{(^^/i), so 



p{y)^e (y^oo) (59) 

with main exponential accuracy (without pre-exponential power factors). 

Another interesting consequence of the assumption ( ^8|) for alih) is the possibility to 
calculate explicitly from eq. (0) the Fourier transform of the distribution density: 

^{u) = ^=2=^e 2^+^'° + " 1+7.^-0 (60) 

V 1 + 7Cr M 

and hence the moments / p{y)y"^dy, which are in fact the coefficients of the Taylor expansion 
of $(u). In particular, for the dispersion A we have 

A' = / Piy)y'dy = -$"(0) = al + ^hl + a2(l + 7) . (61) 

It should be noted that eq. (|56| ) is not yet just the expression to be compared with the 
experimental data distributions in figs. 1-3, since it is supposed to refer to a certain device or 
setup, while experimental curves involve the results obtained on a large number of them. To 
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get from eq. (^) the measured distribution, we must average it over the entering parameters 
a,ao,ho,'y pecuhar to any given device, with some weight P characterizing the relative 
probabihty of finding in the world a device (setup) with certain values of the parameters. 
Moreover, suppose we are dealing (as was said above) only with 'good' measurements, whose 
reported errors reflect quite adequately the dispersion of the setup. Then we can obtain the 
distribution density p{y, A) for the data with a given error A by integrating over a range of 
the parameters subject to constraint (0). Introducing slightly redefined parameters 

(62) 



dh 



b = hoy/y, c = (Ty/y 
in place of ho and 7, we can write by eqs. ( ^61) and (|58|) 

piy, = j P{(^, (^0, b, c) ^ J 



al + (^ - bf 



exp 



2{al+{^-bY) 



(63) 



where the outer integral is to be taken over three-dimensional surface of a hypersphere 

+ al + 6^ + c2 = A2 (64) 

in the space of the parameters (cr, Uo, &, c) (more accurately, the integration region covers 
1/8 part of the hypersphere where cr > 0, do > 0, c > 0). To put it differently. 



j dS-i = J dadaodbdc6{A - ^ + al + }P + c^) 



(65) 



The distribution of devices P(cr, ctq, 6, c) in eq. (|63|) is supposed to obey the normalization 
condition 

' Pdadaodbdc= I PdAdS3 = 1. (66) 



(67) 



The experimentally studied quantity is not quite p{y,A) but rather the distribution in 

p(0 = ldydA piy, A) 5(e - 1) = / A rfA pi^A, A) . 
Inserting here eq. (|63D and scaling out the variable A, 



cr = ziA, (Xq = Z2A, b = z^^A, c = Z4A, 
h = ta = tziA, dS-s = A^dn, 



we obtain 



Po(^,z) 
P(z) = 



1 

2^ 



p(0= /rffip(z)po(e,z) 

t2 



dt 



2 



Zl + (tZ4 - ^3)2 



2[z| + (tZ4-Z3)2] 



y dA A^P{ziA, Z2A, Z3A, 24A), J dVL P{z) = 1, 



(68) 

(69) 
(70) 
(71) 
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where now z is a unit four- dimensional vector and d^l an element of the surface on a unit 
hypersphere, J2i=i zf = 1. The d^l integration in eqs. (|69|) and ( fflD goes over the region 

-1 < Z3 < 1, < Zi,2,4 < 1. 

Eq. ( |69D seems to be of little use without knowing the distribution of devices P(cr, o"o, b, c) 
which is rather indefinite, just like it was with eq. (^Tj) containing an unknown function q{u, t). 
However, there turns out to be a substantial difference between these two formulae: whereas 
eq. (^) is sensitive to the form of the function q{u,t) and the details of its behaviour, it is 
quite another matter with the function P(z) in eq. ( |69[ ) where the other multiplier po{C,, z) in 
the integrand is a function of z sharply peaked at the spherical pole 2:4 = 1, zi = ^2 = ^3 = 0. 

To be more precise, consider the function po(^,z) for ^ 3> 1. Unless Z4 — > 0, the integral 



in eq. (ffOD then has two saddle points close to t = ±y^/z4 and, in perfect analogy with 
eq. dH), ^ 

Po(e,z)~e~^. (72) 

(The case of ^4 — > can be disregarded, since the integral then becomes Gaussian and 
Po(^,z) falls off rapidly as exp(— ^^/2) giving a negligible contribution into the dfl integral 
of eq. ( |5PD ). We conclude that po{C,,z) at large ^ has a sharp maximum at 2:4 = 1 and 
decreases rapidly with the departure from this pole. It means that that if we suppose P(z) 
to be more or less smooth on the sphere, the integral in eq. (|67D will be dominated by a 



small region around this pole where -P(z) can be replaced by a constant, so that the details 
of the distribution -P(z) become inessential, and we get 

p(0~e-«. (73) 

This is the result striven for: not only the distribution density p{C,) decays exponentially, but 
even with the unit coefficient in the exponent, just as it was experimentally found. True, 
the justification holds only for ^ ^ 1 and depends on some hypothesis as to the form of the 
distribution of devices P{a,ao,b,c); but this hypothesis does not seem very restrictive, and 
the asymptotic form ([73|) of p(^) may be hoped to keep approximate validity up to ^ ~ 1 as 
is often the case. 

It is worthwhile to refine eq. (|73D with the inclusion of a pre-exponential (power) factor. 



With this aim in view, let us rewrite eq. ( |68|) in the large ^ limit with more accuracy. It 



follows from eq. ([72|) that Z4, can be assumed to differ from 1 only by a quantity of order 1/^ 
or less, for beyond that region po(^,z) vanishes more rapidly than ~ exp(— ^) and can be 
put equal to zero when integrated over dfl in eq. (|67D. So we shall take 



Z4^^{zl + zl + zl)<^, (74) 



which means that zi, Z2, Z3 < ^ ^^'^ . Putting 




and expanding the exponent in the integrand of eq. (^) in r, one can check that the quadratic 
term is of order r^/^ and so r ~ Thus, keeping the terms ~ 1, we can write eq. ( ffOD 

in an approximate form: 



Po(e,z)^ — /^e 2 2tz4V ^ tz^) 
Ztx J \t\ 
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-^T^^e ^1 cosh(ye(^i-2;3)). (75) 
A cosh function arises as a result of adding together the contributions of the two saddle 



points, t = ±^^/z4. 

Inserting this expression into eq. ( [70|) and supposing P(z) to be smooth and nonvanishing 
at the point Z4 = 1, we get 

p(0 ^ -i=e-«P(o, 0,0,1) /rffie-|(^^'+^^^3)cosh(v^(^i -^3)). (76) 

For small Zi, Z2, z^ ~ the sphere can be approximated by a plane and thus d^l replaced 

by dzi dz2 dzs, so the integral in eq. ([76| ) is proportional to ^"^/^ and finally 

p(0~^e-« (e-00). (77) 

The omitted multiplicative constant includes the uncertain quantity 
P(0, 0, 0, 1) with a definite numerical factor, easily computable but not too interesting. 

It must be admitted that the assumption of -P(z) const at ^4 ^ 1 which is of im- 
portance here, is taken rather arbitrarily and it is difficult to say what the behaviour of the 
function P(z) at 2:4 — 1 should be like in reality. If, in particular, P(z) ~ (1 — z^Y or, more 
generally, is a homogeneous function of the power 2z/ in 21,^2, 2:3, then in eq. ( |77|) is 
changed by C,^"^^"- (In terms of the parameters entering eq. ([58| ) this implies the distribution 
behaving powerwise with 7 at 7 — 00). The main point however is that the exponential 
exp(— ^) is unaffected by these variations even in the case of a more rapid than power-like 
vanishing of P(z) at 2:4 = 1. 

Of some interest may be the limiting case when P(z) is concentrated near the point 
Z4 = 1, i.e. 

Piz) = 6iz,)5iz2)5{z,), (78) 

which corresponds to the situation of all devices having very large 7 in eq. (0) and <^ 7. 
Then eqs. (|^) and (|70D give 

1 /""^ dt *^ 5^ 1 
P(0 = -/ = -M0, (79) 

n Jo t TT 

where Kq is the McDonald function (modified Bessel function of the second kind). This 
expression holds for ,^ ~ 1; it fails at ^>7 because then the range of variation of -21,2,3 ~ 
becomes comparable with (or less than) the smearing of (5-functions in eq. (|78D, which by 
eqs. (|68|) and (|6l|) has the order a/A ~ 7^^/^. Asymptotic form of eq. (|79|) at ^ 3> 1 

pK)«^e-^ (80) 

is similar to eq. ( ffTl) but naturally differs in pre-exponential power of ^. 

Eq. (79) breaks down also at ^ — where -ft'o(0 has a logarithmic singularity; it ceases 
to hold when ^ ~ ^-1/2 go that again the smearing of (5-functions becomes effective (we 
suppose 2:1 ~ ^2 ~ -23 ~ 7 ""^^^j i-G- 0" ~ (To ~ /ioa/7). A real distribution density is finite at 
,^ = 0, p(0) ~ In 1/7. Thus the range of validity of eq. (|79D is 



^ « e « 7- (81) 
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Here 7 represents the minimal value of this parameter among all devices, which is supposed 
to be 3> 1; this case does not look realistic but is considered to illustrate the weak dependence 
of p(^) on the distribution of devices. 

6 Conclusion 

Main results of this work are somewhat different from those of the previous work and can 
now be summed up in brief as follows. 

First, estimates of probability of deviation of the measured from true value of some 
quantity at large ^ (>4) do not seem to make any sense at all without supplementary 
information, neither by Gaussian nor by any other similar formulae. There is no universality 
at large ^ ; typical is the existence of a critical value of ^ (dependent on the sample of data) 
beyond which the deviation probability almost ceases to fall down. This slowly dropping 
large ^ tail of a distribution is natural to associate with the contribution of the measurement 
results perverted by the overlooked systematic errors ('bad' measurements). 

Second, there is a conspicuous departure from the standard law also at smaller ^, having 
a different form and hardly attributable to the same effect which is expected to be negligible 
at small ^. Among other explanations, the effect of the error uncertainty resorted to in 
earlier papers 0, |^ does not seem to be an apt one, for its manifestations are too indefi- 
nite and unspecific, while the experimental distribution in this region shows an intriguing 
correspondence with the simple exponential ~ exp(— ,^). 

Third, a new effect has been found (the variable dispersion effect) capable of inducing 
the distribution to have asymptotic behaviour p(^) ~ exp(— ^) at large ^, with some less 
definite pre-exponential (power) factor depending on the distribution of measuring devices 
over their parameters. The effect takes into account a possibility of variation of the device 
dispersion with changing value of the measured quantity and may occur as a manifestation 
of the detected and corrected systematic errors, even in the cases of 'good' measurements. 

It should be noted that the approach applied here is merely a model, by no means 
the only possible. There can exist several other sources of deviation from the standard 
law — the error uncertainty or those mentioned in Sec. 4. In particular, the former is hardly 
believable to have no appreciable influence, since the experimenters usually do not know their 
measurement errors with much accuracy. It may happen that various effects compensate 
each other and leave alone the variable dispersion effect, but one cannot say for sure if it is 
really so. Maybe the question could be elucidated with more abundant statistics and more 
detailed study of distributions in different subsamples. It seems of interest also to study the 
analogous distributions of data obtained in other areas of science, not only in elementary 
particle physics and not only in phisics at all. The questions touched on here are of evident 
practical importance when any estimates of probability of some events are involved; one 
should have a clear notion of what the degree of validity of such estimates is like. 
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